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On the Theory of Correlation for any Number of Variables , 
treated by a New System of Notation . 

By G. Udny Yule, Newmarch Lecturer on Statistics, University College, 

London. 

(Communicated by Professor 0. Henrici, F.B.S. Received January 25,— 
Read February 28, 1907.) 

1. The systems of notation hitherto used by writers on the theory of 
correlation are somewhat unsatisfactory when many variables are involved. 
In the present paper a new notation is proposed which is simple, definite, and 
quite general, thus very greatly facilitating the treatment of the subject. The 
majority of the results given in the sequel were, in fact, first suggested by 
the notation itself. 

2. Let xix 2 ...x n denote deviations in the values of the n variables from 
their respective arithmetic means. Then the regression equation may be 
written 

Xi = &12.34...»^2 + &13*24...»^3 + • • • + &ln-23...»-l^n. (1) 

In this notation the suffix of each regression coefficient completely defines 
it. The first subscript gives the dependent variable, the second the variable 
of which the given regression is the coefficient, and the subscripts after the 
period show the remaining independent variables which enter into the 
equation. It is convenient to distinguish the subscripts before and after the 
period as “ primary ” and “ secondary ” subscripts respectively. The order in 
which the secondary subscripts are arranged is indifferent, but the order of the 
two primary subscripts is material; e.g., bi 2 .$... n and 621.3... »■ denote two quite 
distinct coefficients. A coefficient with p secondary subscripts may be termed 
a regression of the^th order, the total regressions 6 i2 , 613, 623, etc., being thus 
regarded as of order zero. 

3. The correlation-coefficients may be distinguished by subscripts in 
precisely the same manner. Thus the correlation ...» is defined by the 
relation 

^12‘34...n = (&12-34...n • ^21-34...n)^. (2) 

In the case of the correlations, the order of both primary and secondary 
subscripts is indifferent. A correlation with p secondary subscripts may be 
termed a correlation of order p, the total correlations r 12 , r 13 , r 23 , etc., being 
regarded as of order zero. 

4. If the regressions in equation (1) be determined as usual by the method 
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Theory of Correlation for any Number of Variables, etc. 183 

of least squares, the difference between #1 and the expression on the right, for 
any observed set of values of #i# 2 ...# n , may be denoted by %i. 2 s— n • that is 

#1-23...» = #1 — — &13-24...n#3 ~ ~ &ln.23...n-l#n* (3) 

Such a residual, or deviation, denoted by a symbol with p secondary 
subscripts may be termed a deviation of the pth order, #i# 2 ...# w being regarded 
as deviations of order zero. 

5. Finally, the standard deviation 0 - 1 . 23 ...» is defined as given by the relation 

N" • C 2 l-23 ...n = 2 (aJ 2 l.23...»)> (4) 

N being the number of observations. If the standard deviation be denoted 
by a symbol with p secondary subscripts, it is of the ^?th order, the total 
standard deviations being regarded as of order zero. 

6 . In terms of this notation, the normal equations from which the regressions 
are determined may be very briefly written, in the form 

2 (#2 • #1*23...n) == 2 (#3 . #1.23...n) = • • • 

= 2 (X n . #i.23...n) = 0. (5) 

That is to say, we have the general theorem: “ The product-sum of any 
deviation of order zero with any deviation of higher order is zero, provided 
the subscript of the former occur amongst the secondary subscripts of the 
latter.” 

7. It follows that the product-sum of any two deviations of the same order, 
with the same secondary suffixes, is unaltered by omitting any or all of the 
secondary subscripts of either and, conversely, the product-sum of any 
deviation of order p with a deviation of order p + q, the p subscripts being the 
•same in each case, is unaltered by adding to the secondary subscripts of the 
former any or all of the q additional subscripts of the latter, for we have 
by § 6 

2 (#1.34...n#2-34...w) = 2 (#1.34.. .n) (#2 — ^23-4...n#3— • • • —^2»-3...w-l#w) 

= 2 (#1.34„.n#2). 

Similarly 

2 (#1.34...w#2.34...n—l) ==: 2 (#1.34...w#2)> 

and so on. Therefore, quite generally, 

2 (#1.34...n#2.34...n) ==: 2 (#1.34...w#2‘34...w— l) = ••• 

... =2 (#1.34...w#2). (6) 

8 . It follows from § 7 as a corollary from § 6 that the product-sum of any 
two deviations is zero if all the subscripts of the one are contained among the 
;secondary subscripts of the other. 

These theorems (§§ 6 — 8 ) give the key to simple deductions of many 
results in the theory of multiple correlation. 
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9 . We have from the last section and § 7 , 

0 = S (^2-34...w^l-234...n) 


= S (% 2 .M...n) (^1 — ^12.34...n^2— terms in x 3 to x n ) 

= ^ {X\X 2 . 34... n) ^12*34 ...n^ (#?2‘^2‘34...w) 

= S (^1-34...w^2-34...n) ~ ^12*34 ...rX (^ 2 2-34...w)- 


That is 



a> 


But this is the value that would have been obtained by taking a regression 
equation of the form 

^1-34-n = b\ 2 . 3 ^... n X 2 . 3 /^... n) 


and determining b 12 . 3 ^.. n by the method of least squares. That is to say,. 
&i2.34-?i may be regarded, quite generally and without any reference to the 
form of the frequency distribution, as the regression of Xi.u... n on ^2-34-n- It' 
follows at once from the definition ( 3 ) that ri 2 . 3 ±... n may be regarded as the 
correlation between ^i.34... n and £2.34...^ and from ( 4 ) that we may write 



( 8 > 


All the relations, in fact, that hold good between deviation-sums, standard 
deviations, regressions and correlations of order zero, are also valid between* 
deviation-sums, standard deviations, regressions and correlations of any high 
order. 

10 . This result is of some importance as regards the interpretation of partial 
correlations and regressions. In the case of normal correlation there is no 
difficulty in assigning a meaning to these constants, as the regression is. 
strictly linear, and the partial correlations and regressions are the same for all 
types of the variables. But in the general case this is not so, and although I 
showed, in a previous discussion of the question,* that the values assigned ta* 
the partial regressions on the assumption of normal correlation are the “ least 
square” values and, consequently, that the partial correlation retains an 
“ average significance,” I could not prove that it remains an actual correlation 
between determinate variables. The above theorem completes the work in this, 
respect. If, with three variables x 1} x 2) and x 3) for example, the two regressions. 
b 13 and b 23 be determined in the ordinary way, and then the residuals 
x h3 = — b 13 x 3) x 2 . 3 = x 2 — b 23 x 3 be calculated for all sets of observations- 

x\x' 2 x' 3) x'\x" 2 x" 3) etc., the correlation between x \. 3 and x 2 . 3 is r 12 . 3 . A 
similar interpretation holds for any greater number of variables. 

* 4 Roy. Soc. Proc., J vol. 60 (1897), p. 477 ,* 4 Roy. Stat. Soc. Journ., 5 vol. 60 (1897),, 

p. 812. 
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1907.] Variables , treated by a New System of Notation . 185 

Such a relation would not, of course, afford a practical method of calculating 
the partial coefficients, as the arithmetic would he extremely lengthy. 

11. Any standard deviation of order p may be expressed in terms of a 
standard deviation of order p — 1 and a correlation of order p — 1 . For we 
have, using the theorems of §§ 6 and 7, 

S(^ 2 l-23 ...n) = £ (#1.23...n-l^l-23.,.n) 

= S(^i. 23 ...w-i)(^i—^Iw. 23 ...»- 1 ^»—terms in x 2 to x n -i) 

= ^ (*^ 2 1.23...»— l) ^ln*23...n—(^1*23..l^»*23...»—l) j 

or, dividing through by the number of observations, 

G* 2 l-23..•» = cr 2 l-23—»-l (l““^l»*23—»-l^»l-23—w-l) 

= 0" 2 l-23—n-1 (1— ^ 2 ln-23-w-l)- (9) 

The form of this relation is the same as that of the familiar relation 
between a standard deviation of the first order and a standard deviation of 
order zero, with the secondary subscripts 23 .-»_i added throughout. It is clear 
from (9) that ri n . 2 s... n -i cannot be numerically greater than unity. It also 
follows at once that if we have been estimating x 1 from x 2 , x s ... x n _ h x n will 
not increase the accuracy of estimate unless ri w . 23 ... w _i (not r Xn ) differ from 
zero.* 

12 . In equation (9) the subscript n is eliminated from the suffix of 0 - 1 . 23 ... w , 
and it is obvious that any other subscript can be eliminated in the same way. 
Therefore we must have 

(1— ^ 2 12-34-n) == CT 2 l-24-» (1 “^ 2 13-24-.-n) 

= ... = 0- 2 i. 23 ... n -i (1—r 2 i n .23 ..n-l)- (10) 

Further, we have 

0' 2 l-23-»-l = cr 2 l-23—»-2(l—^(n-l) • 23— n- 2 ) 

0" 2 1.23-n-2 = 0- 2 l-23—n-3 (1— ? ,2 1 ( n -2) • 23-n-3)> 

and so on; so that 

<r 2 i.23...n — o* 2 ! (1—r 2 i 2 ) (1—r 2 ] 3 . 2 ) (1 —r 2 i 4 . 23 ).. .(1 — r 2 i n . 23 ... n _i). (11) 

This is an extremely convenient expression for arithmetical use, as 
illustrated later. A complete check on the arithmetic is obtained by 
eliminating the secondary subscripts in a different, say the inverse, order, i,e., 
by using the result— 

<r\. 2 z-n — <r 2 i (l—r 2 ln ) (1—r 2 i (w _i). n ) (1 — r\ (w _ 2) . w (w -d). . .(1 — r 2 12 . 34 ... n ). (12) 

* Cf, proofs for cases of 3 and 4 variables previously given (loo. cit . in previous note). 
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13. Any regression of order p may be expressed in terms of regressions 
of order p — 1. For we have 

^ (^l-34—w^2-34—w) = ^ (^l-34---w—1^2-34—w) 

= 55 (^i.34...w-i) (^2—& 2 w. 34 ...w-i^ —terms in x 3 to x »-i) 

= S (^l-34—w-1^2-34—w-l)-—^2w-34—w-lS (^1.34—»-l^»*34-»-l)* 

That is, replacing & 2w . 3 4 -w-i by & w2 . 34 -w-i x cr 2 2 .M... n -i/(r 2 n-u-n-i, 

&12-34-»0' 2 2.34..-» = &12-34—w-1 — &lw-34—w-1&W2-34-W- lCT 2 2-34— n~- !• 


Therefore, by equation (9), 


b 


12-34— n 


&12-34—w-1— frltt-34—n—1 - &W2-34—w-1 

1 — &lw-34— n- l^wl-34— n -1 


(13) 


But this is simply the expression for b\ 2 . n in terms of &i 2 , bi n , b n i, and & n2 , with 
the subscripts 3 4 ... w -i added throughout. Therefore & 12 . 34 -W may be regarded 
as the partial regression of ^. 34 -n-i on £ 2 . 34 -w-:b ^n- 34 -n-i being given. As 
any other secondary subscript might have been eliminated in lieu of n , we 
can also regard it as the partial regression of aq. 45 ... n , on & 2 . 45 ... w , ^ 3 - 45 »-« being 
given, and so on. 

14. Equation (13) may be written in terms of the correlations:— 


^12-34— n 


J- — • lw-34—w-l °2-34 —n 

Hence, writing down the similar expression for & 2 i. 3 4 ... w , 
square root of the product, 

_ ^12-34--»— 1 ^l»-34—n— 1^2»-34--n— 1 

12*34"*n - (1 _ r 2 ln ^ n _ i)h (1-^34...,-^ * 


and taking the 
(14) 


This is, similarly, the expression for r^n in terms of r\ 2 , and r 2n , with 
the secondary subscripts 34 ... w -i added throughout, and accordingly r 12 . 34 -.-w 
may be regarded as the partial correlation between Xi.^—n -1 and a? 2 . 34 .-w-i> 
^n- 34 -w-i being given, and so on, as for the regression. 

15. It is clear that equations (13) and (14) imply a series of relations 
between correlations or regressions of orders less than n — 2 with n variables, 
for all the expressions obtained by eliminating 34 ... w in turn from the 
secondary subscripts of the constant on the left must be equal to each other. 
Further, every coefficient of the pth order can be expressed in terms of the 
coefficients of the (p — l)th order in p different ways, by eliminating each of 
the p secondary subscripts in turn. This enables an absolute check to be 
kept on the arithmetic by calculating each coefficient in at least two distinct 
ways. 

16. By the use of equation (14), the work of calculating correlation 
coefficients of higher orders is rendered quite simple and straightforward. 
The use of equation (13) for calculating the regressions is comparatively 
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1907.] Variables , treated by a New System of Notation . 187 


clumsy, however: when the correlations have been found, it is best to work 
out the standard deviations by equation (11), and then the regressions are 
given at once by (8). The following data, taken from a discussion of 
pauperism,* will serve as an arithmetical illustration, the variables being 
the percentage changes during a decade in the poor-law unions of England in: 
(1) the percentage of the population in receipt of poor-law relief; (2) the 
ratio of the numbers given relief out-doors to one indoors (in the workhouse); 
(3) the proportion of aged (over 60) in the population; (4) the population 
itself. The values of the correlations of order zero are given in Table I, and 
the logarithms of (1 — r 2 )^, required in the calculations, are entered in the 
third column. These coefficients are next grouped in sets of three, one set to 
each possible group of three variables, as in the second column of Table II, 
and the coefficients of the first order are then calculated from (14). For 
convenience in calculating the coefficients of the second order, the values of 
log (1 — r 2 )* are again entered in the last column. 

Table I. 


Correlation 

coefficient. 

log Vl — r 2 . 

12 

+ 0*52 

1 *93154 

13 

+ 0*41 

1 *96003 

14 

-0*14 

1 *99570 

23 

+ 0*49 

1 *94038 

24 

+ 0*23 

1*98820 

34 

+ 0*25 

1 *98598 


Table II. 


Correlation 
coefficient 
(zero order). 

Product 
term of 
numerator. 

Numerator. 

Correlation 
coefficient 
(first order). 

log a/1 — r 2 . 

12 

+ 0*52 

+ 0 *2009 

+ 0*3191 

12 *3 

+ 0 *4014 

1*92370 

13 

+ 0*41 

+ 0*2548 

+ 0*1552 

13 *2 

+ 0*2084 

1*98071 

23 

+ 0*49 

+ 0*2132 

+ 0*2768 

23 *1 

+ 0*3553 

1*94139 

12 

+ 0*52 

-0 *0322 

+ 0*5522 

12 *4 

+ 0*5731 

1*82709 

14 

-0*14 

+ 0*1196 

-0*2596 

14*2 

-0*3123 

1 *95544 

24 

+ 0*23 

-0 *0728 

+ 0*3028 

24 *1 

+ 0 *3580 

1 *94044 

13 

+ 0*41 

— 0 *0350 

+ 0*4450 

13 *4 

+ 0*4642 

1*89460 

14 

-0*14 

+ 0*1025 

-0 *2425 

14 *3 

-0 *2746 

1 *96595 

34 

+ 0*25 

-0 *0574 

+ 0*3074 

34 *1 

+ 0 *3404 

1*94652 

23 

+ 0*49 

+ 0*0575 

+ 0*4325 

23 *4 

+ 0*4590 

1*89726 

24 

+ 0*23 

+ 0*1225 

+ 0*1075 

24 *3 

+ 0*1274 

1*99290 

34 

+ 0*25 

+ 0*1127 

+ 0*1373 

34 *2 

+ 0*1618 

1*98848 


* ‘ Roy. Stat. Soc. Journ.,’ vol. 62 (1899), p. 249. 




















Downloaded from rspa.royalsocietypublishing.org 


o 
z 
oc 


2<Z}U 


^u 
Sq.o3i/> 



Ss 


w u 


Pdo 

H m 



_r a 
< z 

y 5 

b m 
2<zm 

^S=!v5^ 

h^zS 

5>ui- 

2i v u 
S0.08 m 



Ss 


w U 

Ko 

H ^ 



188 Mr. G. U. Yule. Theory of Correlation for [Jan. 25, 


Table III. 


Correlation 
coefficient 
(first order). 

Product 
term of 
numerator. 

Numerator. 

Correlation 
coefficient 
(second order). 

log VT — r 2 . 

12 *4 

+ 0*5731 

+ 0-2131 

+ 0-3600 

12 -34 

+ 0-458 

1 -89774 

13 *4 

+ 0*4642 

+ 0*2630 

+ 0-2012 

13 -24 

+ 0-276 

1*96559 

23 *4 

+ 0-4590 

+ 0-2660 

+ 0-1930 

23 -14 

+ 0-266 

1-96814 

12 *3 

+ 0-4014 

-0 *0350 

+ 0*4364 

12 -34 

+ 0-458 

_ 

14 *3 

-0 -2746 

+ 0-0511 

-0*3257 

14 -23 

-0*359 

1 -94007 

24 3 

+ 0-1274 

-9 *1102 

+ 0*2376 

24 -13 

+ 0-270 

1 -96713 

13 *2 

+ 0*2084 

-0 *0505 

+ 0-2589 

13 -24 

+ 0 *276 

_ 

14 -2 

-0-3123 

+ 0*0337 

-0 -3460 

14-23 

-0-359 

— 

34 -2 

+ 0-1618 

-0 -0651 

+ 0*2269 

34 -12 

+ 0-244 

1 -97333 

23 *1 

+ 0*3553 

+0 -1219 

+ 0*2334 

23 -14 

+ 0-266 

_ 

24 -1 

+ 0 *3580 

+ 0-1209 

+ 0-2371 

24 -13 

+ 0-270 

— 

34 -1 

+ 0-3404 

+ 0-1272 

+ 0-2132 

34 -12 

+ 0-244 

— 


The first order coefficients, from Table II, are then regrouped according to 
the same primary subscripts as in Table I, and the work repeated precisely as 
before, as in Table III, but each coefficient of the second order is automatically 
calculated by this process in two ways and the work thus checked. Small 
errors introduced by the non-retention of insignificant figures may, of course, 
prevent complete agreement to the last place of decimals, and for this reason 
the coefficients of the first order were evaluated to four figures, although 
only three were required for the final result. In order to obtain the regression 
equation between changes in pauperism and changes in the three remaining 
variables, we require the three regressions £>12.34, £>13*24, and £>14.23 and, 
accordingly, must obtain the six standard deviations, <71.34, <72.34, 01*24, 0-3.24, 

01*23, 04*23* 

These are readily calculated and checked by means of the equations of the 
form— 

01.34 = 01 (1 — ^13)* (1 ~ ^14.3)^ 

•■= 0-1 ( 1 —^14)* ( 1 —^18.4)* 


given <71 = 29*2, <7 2 = 41*7, <73 = 5*5, <74 = 23*8; and the values found are :— 
< 71.34 = 25*61, 0*1.24 =: 27*63, 01*23 = 24*39, 

02*34 == 36*06, 03.24 — 4*73, 04.23 22*86. 

Hence, from the equations of the form 


£>12*34 = ^12*3 


we have 


01*34 

L-, 

02*34 


£> 12.34 = +0*325, £>13.24 = +1*383, £>14.23 = —0*383. 
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That is, the regression equation between changes in pauperism and changes 
in the other factors considered is 

X! = 0*325^2+1*383^3—0*383^4- 

To complete the work, we may calculate 0 - 1 . 234 , the standard error made in 
estimating x\ from x 2 , x s , and x 4 by the above equation. The value is 

0-1-234 = 0-l(l~-r 3 i2)^(l— ^ 2 13-2)Hl‘“ /,2 14-23)^ 

= 0-1 ( 1 — r 2 u)% (1 — r 2 13 . 4 )* (1 — r 2 !2.34^ 

= 22 * 8 . 

17. If, in accordance with the notation used for elementary cases in the 
paper already referred to, and that in a recent note by Mr. E. H. Hooker and 
myself,* we write 

<r 2 l-23-.-n = 0 - 2 l (1 — E 2 i( 23 ...n)), (15) 

Ei( 23 ...») may be regarded,, as a coefficient of correlation between x\ and the 
expression 

01*23 -n = &12.34— n%2 + &13-24.-n^3 + • •. + &ln-23-n-(16) 

The value of E is accordingly a useful datum, as indicating how closely x\ 
can be expressed in terms of a linear function of x 2 x$ ... x n . It may be readily 
calculated either direct from the equation 

1 —E 2 i(23-..n) = (1 — r 2 i 2 ) (1 —r 2 i 3 . 2 ).. .(1 -r 2 u .23... w ~-i), (17) 

or from the value of 0 - 1 . 23 .-n and 0 - 1 , if previously obtained. 

It is obvious from (17) that, since every bracket on the right is not greater 
than unity, 

1 — E 2 i(23-.n) = 1 — T 2 \ 2 . 

Hence Ei( 2 3 ...n) cannot be numerically less than r i2 . For the same reason, 
rewriting (17) in every possible form, Ei( 23 -n) cannot be numerically less than 
ru, riz ,... r in , i.e. 9 any one of the possible constituent coefficients of order 
zero. Further, for similar reasons, Ei( 23 ... n ) cannot be numerically less than 
any possible constituent coefficient of any higher order. That is to say, 
Ei( 23 -.») is not less than the greatest of all the possible constituent coefficients 
of all orders, and is usually, though not always, markedly greater. Thus in 
the illustration of § 16, the value of Ei.( 2 34 ) is 0*626, and the greatest correlation 
coefficient is 7 * 12.34 = 0*458. The sign of E is necessarily positive, for a 
positive increment in x\ obviously corresponds on the average to a positive 
increment in 01 . 23 .-n. More definitely, the standard deviation of 61 . 23 .-n is 
0 "iEi( 23 ...n), and the regression of x\ on 61 . 23 .-n is therefore -f 1 . 

Seeing that 0 - 2 i. 23 -n = cr 2 i(l — E 2 i( 2 3 ».n)), and that 0 - 1 . 23 .-n is a minimum, we 
may, alternatively, regard the values of the regressions as determined by the 
* ‘Eoy. Stat. Soc. Journal,’ vol. 59 (1906), p. 197. 
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condition that the correlation between x\ and 01 . 23 ...*, viz., Ri( 23 ...«), shall he a 
maximum. 

18. It is obvious that equations (13) and (14) imply relations of an inverse 
kind, expressing coefficients of a lower order in terms of those of a higher 
order. Using the same method of expansion as in previous cases, we have 

0 = 2 (^1.23-n^2-34»*w~l) 

= 2 (^1^2-34—ft-1)— &12-34— m 2 (#2^2-34—ft-l) 


^lft-23—w—12 (#?n#?2-34—ft-—l)* 

That is 


&12-34— n-1 — 612 . 34 ... n + ^lw.23—»-l&«2-34—ft-l- 

But by interchanging the suffixes, viz., 1 for n and n for 1, 

^ft2-34-ft-l = &ft2-13—ft-l+ ^ftl-23—ft-1^12-34—ft-l- 

Substituting this value of <^ 2 - 34 —ft -1 in the first equation and simplifying, 


&12-34-M-1 


^12*34—ft + ^lft-23-»-l^ft2-13—ft~l 


l-h 


'lft-23-ft-l^ftl-23-n-1 


(18) 


This is the required equation for the regressions. The similar equation for 
the correlations is obtained at once by writing down the corresponding 
expression for & 2 i- 34 -ft-i and taking the square root 


^12-34—ft-1 


ri2.34-ft-bri^.23... w ~ir2 ft .i3-ft-l 
(1 ~~^ 2 lft-23—ft-l)^ (1 — rVl3...ft-l)* * 


(19) 


19. The general principle that any equation subsisting between such 
statistical constants as correlations, regressions, and standard deviations 
holds good for all secondary subscripts, applies also to the equation ( 3 ), which 
expresses the. individual deviation of order p in terms of deviations of order 
zero. That is to say, we have, quite generally, k being any subscript or 
collection of subscripts, 

$U2-&ft = #1.*— &12-3-Aft^2*A— • • * — hn-2~'k(n--l)%n-k' (20) 

For, if l be any one of the subscripts included under h , and if m denote the 
remaining subscripts, on expanding both sides of the equation in terms of 
deviations of order zero, the coefficients of x h X2, .. .x n are the same. The 
coefficients of xi are equal if 


lZ-2-THft = hl-m—hbm • &12-3-*ft— - • * — Kbm * bln-2—k(n-~ 1)- 

hbm, **-b n bm by product sums, this 


But, replacing the regressions 
reduces to 

2 (pbm * ^l-2--Zmft) ^ 0, 
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which is true by § 8, whether m denote a single subscript or an aggregate, or 
is absent, and equation (20) is accordingly correct. Remembering that 


£ _ G'1‘3 —kn 

0\2-Z-kn — ?12-3 •••kn J - — 


^12*3 -kn 


0"l-23 —kn 


CT2-3—kn CT2-13-kn 

the equation may also be written in the useful form- 


Xi. k 


X\-2—kn _ 

Cl -2—kn <Tl‘2—kn 


■^12-3-An- 


%2-k 


<T2-13—kn 


“^ln-2 —k (n-l)~ 


Xjfk 


<T n -12—k(n-l) 


( 21 ) 


20. In all the preceding sections no assumption of any kind has been 
made with respect to the form of the distribution of frequency, but the 
results may, of course, be applied to the special case of the normal dis¬ 
tribution. Let yi 2 -.. n denote the value of the normal function for the 
combination of deviations xi, x 2 , .. .x n) and y' i 2 ... n the value of the function 
when all deviations are zero, then we may write 

y\ 2 -n = y' 12...* . exp — (X\X 2 ,. .Xn), (22) 

the form of the function <£ being determined by the fact that the distribution 
of every array must be normal, and that the mean of the array of any one 
variable associated with given types of the others must be the linear function 
of those types given by the general regression equation of the form (1). We 
must have, accordingly 

X x2 l t x2 2 , , X 2 n 

9 = o--—2- 

<T 1.23 —n & 2-13—n . <T n-12— (n-1) 

O/m X\X 2 c) X n —\X n YOQ\ 

— ^12-3-n-- ... Ll (n—1) n*12—(n—2)-• \^o) 

<Tl-23-n<r 2-13-n <r ( n - l)-l-(n-2)CT n .i...( M -i) 


But this expression may be thrown into several different forms. Thus, 
replacing the correlated variables, x h x 2} ...x n) by the independent variables, 
x ly x 2 . h x 3 . 12 , ...x n .we have the very useful form 




, Ai2 . X 2 n .i...( n -1) 

o * o * • * * * 2 * 

2 -i 3.12 cr M .i...( n -i) 


(24) 


This expression may be shown to be identical with (23) by expanding in 
terms of deviations of order zero, and reducing the coefficients of the square 
terms by means of the equation 

rr 2 7 n 2 rr 2 , ’ 
v n>k 0 n 0 n-k 

and those of the product terms by an equation derived at once from (19), 
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21. Several important results follow at once from the form of the expression 
(24) for the exponent <p. Since the variables are independent, the central 
value of the normal function, y' i 2 ... n , must be given by the product of the 
well-known expressions for the single variables, i.e., we must have 

y'l 2 -n = N/( 27 r)^Vi< 7240 ' 3 U 2 ...^nU...(n-l). (25) 

22. Again, if we integrate the normal function in the form (24) with 
respect to x x , treating the remaining variables, x 2 . x , x%. 2Xi etc., as constants of 
integration, a x is eliminated from y ' and x x from </>, and all the remaining 
variables in the exponent contain the secondary suffix 1. If x 2 .i, x 3 . 21 , 
...# n .i...( n -i) are then replaced by x 2 . h x 3 . X) ...x n . h <£ may be written in the 
form (23) for these variables. Similarly, if we integrate again with respect to 
x 2 . Xy 0 - 2.1 is removed from y' and x 2 . x from <f> : and all the remaining variables 
in the exponent contain both secondary suffixes 1 and 2. If xs. 2 i, £ 4 . 321 , 
.. -x n -321 are then replaced by # 3 . 21 , £ 4 . 21 , •. .£ n - 2 i, $ may be written in the form (23) 
for these variables. Clearly the process may be continued on the same lines. 
The correlation between all sets of deviations, of any one order, with the same 
secondary suffixes, is therefore normal correlation. 

23. It follows that we may generalise at once the known formulae for the 
probable errors of the constants of a normal distribution. Omitting the 
factor 0*674489...we have, standard error of a 


Standard deviation oq . <n/ s /2'N. 

Correlation coefficient r X2 . 1 —r 2 i 2 /fN. 

Regression coefficient b X2 . <ri. 2 f<T 2 ^]$. 


The first is a well-known result; the last two are cited from the valuable 
memoir by Professor Karl Pearson and Mr. L. K. 0. Filon.* But since a\. k is 
the standard deviation of the normally distributed variable x\.k, r X2 . k the 
■correlation between the normally distributed variables x x .u and x%. k , and bi 2 . k 
the regression of x x . k on % A ,£we must have, quite generally, k denoting as 
before either a single subscript or an aggregate, standard error of a 

Standard deviation < 71 .* .. <n. k ff2'N. 

Correlation coefficient vi 2 . k . 1 —T\ 2 . k l 

Regression coefficient b X2 . k . 0 * 12 .*/ <r 2 , k \/^- (26) 

The last result may be readily verified against the formula arrived at by 
Professor Pearson and Mr. Filon, for the case of three variables, after pages of 
the most laborious work.f The first may be checked for the case of two 

* ‘ Phil. Trans.,’ A (1898), vol. 191, p. 229. 
t Loc. cit., equation xxxviii, p. 260. 
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variables, remembering the result of the same writers,* that the correlation 
between errors in a\ and in r i2 is r^j^/2 ; for we have 

o-j .2 = <n(l — r 2 12 )*; 

d<r i «2 clcr i T \ 2 . dr\ 2 

a 1-2 d 1— r\ 2 ' 

Or, squaring both sides of the equation and summing, using € 1.2 to denote the 
standard error of cri. 2 , 

e\ 2 _ 1 r 2 12 _ 2r 12 r 12 1 l—r\ 2 ___ 1 

(T 2 1.2 2If ¥ l-r\ 2 </2 v/2N ■ 22T' 

23. The question of errors of sampling in the case of the coefficient of %-fold 
correlation, R, is not so simple, owing to the fact that the sign of the 
coefficient is essentially positive and, consequently, it is subject to biassed 
error. If, for instance, a series of variables are strictly independent, but 
values are found for r 12 , r\$. 2i 7 * 14 . 32 , etc., equal to 8 2> 8 3 , S 4 ,...then 

1-R 2 1<23...») = (1-S 2 2)(l-S 2 3)...(l-S 2 n). 

If the 8’s are sufficiently small to enable us to neglect terms of the fourth 
order as compared with those of the second order, then we may write to the 
first approximation, 

R 2 i ( 23 - *n) = 8 2 2 4 - 8% +... + S 2 n . 

Or, summing for a number of samplings and substituting 1/N for 2($ 2 ) in 
each case, the root-mean-square value of R when the variables are strictly 
independent is 

r 0 = (7i~i)vm ( 27 ) 

n being the number of variables and NT the number of observations. R 
cannot be held with certainty to be of definite significance if not markedly 
greater than this, and if the number of observations be small compared with 
the number of variables, the critical value is rather unpleasantly large. Thus 
in the case of a recent investigation by Mr. R. H. Hooker into the relation 
between the weather and the crops, n = 3, IT = 21, consequently 
R 0 = Y^-rr = 0*31 (the value cited by him on my authority).! Clearly, if the 
number of observations be small, it is not worth while dealing with a large 
number of variables. 

* Loc. cit ., equation xviii, p. 242. 
f ‘Roy. Stat. Soc. Journ.,’ vol. 70 (1907), p. 7. 













